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Abstract 

We present the results of a Bayesian analysis of a Regge model to describe the 
background contribution for K + A and K + YP photoproduction. The model is 
based on the exchange of K + (A94) and if*+(892) trajectories in the i-channel. 
We utilise the Bayesian evidence Z to determine the best model variant for each 
channel. The Bayesian evidence integrals were calculated using the Nested Sam- 
pling algorithm. For different prior widths, we find decisive Bayesian evidence 
(AlnZ w 24) for a K + A photoproduction Regge model with a positive vector 
coupling and a negative tensor coupling constant for the K* + (892) trajectory, 
and a rotating phase factor for both trajectories. Using the \ 2 minimisation 
method, one could not draw this conclusion from the same dataset. For the 
K + YP photoproduction Regge model, on the other hand, the difference between 
the evidence integrals is insufficient to pinpoint one model variant. 
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1. Introduction 

To resolve the structure of the nucleon, many models based on effective 
degrees of freedom have been developed. One of the more popular ones is the 
constituent quark model (CQM), which presents the nucleon as a system of 
three constituent quarks [1]. The CQM, however, predicts far more resonances 
than confirmed by experiment. This may lead one to turn to other models that 
predict fewer resonances [2]. 

Alternatively, this discrepancy may arise because missing resonances do not 
couple to the channels commonly used for nucleon spectroscopy, such as the 
pion-nucleon (ttN) channel. This issue can be addressed by using an electro- 
magnetic probe instead of a pion, and by examining decay channels other than 
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irN. Hence, electromagnetic open strangeness or kaon-hyperon (KY) produc- 
tion is suggested as a key process to seek for unobserved resonances. The analy- 
sis of this process faces other difficulties, such as a small cross section and a high 
threshold [3, 4]. Unlike threshold one-pion photoproduction, which is dominated 
by the A(1232) resonance, the KY reaction channel opens in a resonance-rich 
energy region. The identification of those resonances constitutes a major chal- 
lenge in modelling KY production. A second characteristic of this channel is 
the great importance of the non-resonant or background contributions. Hence, 
a correct determination of the background is crucial for a correct assessment of 
the resonance contributions. 

The earliest studies of the KY production focused on the estimation of 
coupling constants within a single model variant [5, 6]. Only when the emphasis 
came to lie on identifying missing resonances did the focus shift from parameter 
estimation to model comparison. The statistical tools, however, have not been 
adapted to this new objective. The least-squares method in particular has often 
been stretched beyond its limits, being used not only as an optimisation tool, 
but also as a model selection criterion. In this Letter, we present the Bayesian 
evidence computation, a method based on the principles of Bayesian inference, 
as a more robust and well-founded tool for model comparison. 

The outline of this Letter is as follows. The next section introduces the 
Bayesian evidence and the Nested Sampling (NS) algorithm for evidence com- 
putation. The effectiveness of this algorithm is subsequently demonstrated for 
a Regge model in Section 3 and the results are discussed in Section 4. The 
conclusions and outlook are given in Section 5. 



2. Bayesian analysis 

Bayesian analysis is an established tool for model selection in astronomy 
and cosmology, and is gaining momentum in other fields [7, 8]. The potential 
of this method in hadronic physics was recently demonstrated in a Bayesian 
analysis of pentaquark data by Ireland et al. [9], and for parameter estimation 
in effective field theories by Schindler et al. [10]. The quantity of interest for 
model comparison is the Bayesian evidence, which will be derived below. 

One can straightforwardly derive the posterior probability of a model M, 
given a set of experimental data {dk}- Indeed, using Bayes' theorem, one can 
write this probability as 

The quantity P({dk} \M) is referred to as the marginal likelihood or the Bayesian 
evidence (Z). If the model M can be parametrised with a set olm, this proba- 
bility can be written as an integral over all possible values of these parameters. 
This procedure, which is referred to as marginalisation, yields the following 
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Table 1: Jeffreys' scale for the natural logarithms of evidence ratios A In Z = In [11, 12]. 

A In Z < 1 Not worth more than a bare mention 

1< AlnZ < 2.5 Significant 

2.5 < |Aln£| <5 Strong to very strong 

5 < A In Z Decisive 



expression for the Bayesian evidence: 

Z = P({d k }\M) (2) 

= J P{{d k },a M \M)da M (3) 

= / P{{d k }\a M ,M) P(a M \M) da M - (4) 

J " v ' s v ' 

(0 («) 

Eq. (4) states that the Bayesian evidence is the integral of the product of two 
distributions: (i) the probability of the dataset {d k }, given the set of parameters 
olm and the model M, and (ii) the probability of the set of parameters cx.m, 
given the model M. The first factor, P({d k } \ ctM, M), can be identified as the 
likelihood function, C{olm )• Any prior knowledge of the parameters' probability 
distribution before considering the data {d k } is contained in the second factor 
P(olm\M). This distribution, which is indispensable in Bayesian statistics, is 
referred to as the prior distribution ti(cxm)- These two substitutions allow us 
to write the evidence in a more familiar form, 

Z = j C{a M )'K{a M )da.M, (5) 

in which the explicit dependence on {d k } and M is omitted for brevity. 

It is clear that the actual quantity of interest for model comparison is the 
relative probability of a model Ma versus a model Mb, given the available ex- 
perimental data {rffe}. Writing down the probability ratios and subsequently 
applying Bayes' theorem, one can see how the evidence emerges from this ex- 
pression: 



P(M A \{d k }) = P({d k } \M A ) P(M A ) 

P(M B \{d k }) P({d k }\M B ) P(M B ) U 
= for P{M A )=P{M B ). (7) 

Any prior preference for one model over the other can be incorporated by 
the factor p[^j ■ As we have no prior preference for one of the models, we 
can take this value to be one, hence reducing the comparison of two models 
to the calculation of the evidence ratio, which is often referred to as the Bayes 
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factor. The direct relation between Z and a model's probability elucidates the 
term "evidence": if a model has a higher value of Z, there is more evidence 
in favour of this model. In accordance with our intuitive notions, evidence is 
not only based on experimental data, but also on theoretical restrictions that 
are incorporated through the prior distribution. The natural logarithm of the 
evidence ratio can be interpreted qualitatively with the aid of Jeffreys' scale, 
listed in Table 1. 

The analytical form of the likelihood function C{olm) is rarely known and 
a normal distribution is often used to approximate it. Indeed, data points are 
independent and are usually reported to have normally distributed errors. This 
gives rise to a x 2 -distribution for the quantity defined as 



where Oi is the error bar of data point di, and fi{a.M) is the corresponding model 
prediction. The % 2 -distribution can be approximated by a normal distribution 
if the number of degrees of freedom k is sufficiently large. This value is defined 
as k — N — dim(aM); which approximates the number of data points for a 
sufficiently large dataset and low number of free parameters. The expression for 
this limiting distribution is [13]: 



In an analysis based on \ 2 minimisation - which is an approximation to a 
maximum likelihood fit - only the maximum value of C(olm) is considered for 
model selection. A Bayesian approach is more comprehensive, as it evaluates 
the model over its entire parameter space, and takes the prior distribution into 
account. This distinction is illustrated in Fig. 1. 

Determining the evidence of a model is not a straightforward task, because 
it requires the calculation of multidimensional integrals of the type (5). Most 
often, analytical simplifications are not possible and it is key to adopt numerical 
integration techniques that are optimised for the problem at hand. 

Nested Sampling (NS) is a novel integration technique for computing Bayesian 
evidence, developed by Skilling [14, 15]. This technique significantly reduces the 
computational cost of the integral over the model's parameter space by trans- 
forming it into a one-dimensional integral over the prior mass dX = it{aLM)dot.M ■ 
This is accomplished by regarding the prior mass as a monotonically decreasing 
function of the likelihood, A: 



Assuming a normalised prior, we can hence write the evidence as the follow- 
ing integral over C{X), the inverse of X(X), 






(9) 





(10) 




(11) 
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Figure 1: C(a) -7r(a) for two different single-parameter models Ma and Mb- The traditional 
least-squares method would favour Mg, as it only takes into account the maximum value of 
the likelihood. In contrast, a Bayesian approach would favour model Ma, as its evidence, 
given by Eq. (5), is greater. 




ai crj aj 



Figure 2: The Bayesian evidence Z as an integral of the likelihood C over the prior mass X . 
The lower plots illustrate that for a uniform prior, the increasing prior masses X(d) are equal 
to the area or mass { a \ C(a) > d} inside nested iso-likclihood contours. The thick arrows 
indicate the order of integration. 
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as illustrated in Fig. 2. This integral can be approximated by a sum of likeli- 
hoods, weighted with their respective prior mass contribution AX. In NS, this 
sum is computed using Markov chain Monte Carlo methods to sample from the 
parameter space with the constraint > £j. The prior mass of a sample of 
N points with likelihood greater than £j can be estimated by a factor exp — . 
This can be derived from the probability distribution of the largest value tjy of 
a set of N uniformly distributed points in the interval [0, 1] (representing the 
total prior weight): 

P N (t) = Nt N -\ (12) 

Indeed, the estimation value for (\ogt) N is — ^, which after i iterations amounts 
to a sum of — j^. 

Besides efficiently calculating the evidence, NS can also be used for determin- 
ing the posterior distribution and, more importantly, for parameter estimation. 
A more detailed account of the technique as well as implementation examples 
can be found in Skilling's work [15]. 

In the following section, we will present the results of an application of the 
NS technique to a Regge model for K + A and K + T,° production. 



3. Bayesian analysis of a Regge Model 

Regge phenomenology is a powerful tool to economically describe reactions 
at high energies [16]. A Regge model based on the exchange of charged me- 
son Regge trajectories was applied to KY photoproduction by Vanderhaeghen, 
Guidal and Laget [17]. They found the exchange of two i-channel trajecto- 
ries sufficient to successfully describe the cross sections as well as polarisation 
observables in photoproduction of kaons above the resonance region [18]. 

In the Regge-plus-resonance (RPR) description of electromagnetic KY pro- 
duction, developed by Corthals et al., the Regge background is complemented 
with s-channel nucleon (N*) resonances. This hybrid approach ensures a cor- 
rect high-energy behaviour as well as an improved description of the resonance 
region [4, 19, 20]. 

In the case of K + A and K + S° photoproduction, the Regge background can 
be modelled with the exchange of the if" 1 " (494) and if * + (892) trajectories. This 
amplitude is derived from the i-channel Feynman amplitude by replacing the 
Feynman propagator by the respective Regge propagator [4] : 



+ / s 
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T(l + <**(*)) 1 e 
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.(t)-i j 
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(14) 



r(a*.(i)) 

The kaon trajectories are given by [4]: 

a K (t) = 0.70 GcV~ 2 (t-m 2 K ) (15) 
a* K (t) = 1 + 0.85 GeV~ 2 (t-m 2 K ,) , (16) 

and the scale factor s = 1 GeV~ 2 . The so-called sign factor in the Regge 
propagator is reduced to a phase factor of either 1 (constant phase) or e~ ma ^ 
(rotating phase) due to the strong degeneracy of the trajectories. This assump- 
tion is inspired by the structureless high-energy differential cross-section. These 
phases cannot be determined on theoretical grounds. The possibility of the 
K + and K* + trajectories having a constant phase is excluded, as this combi- 
nation gives rise to a photon asymmetry E = 0, which disagrees with the data. 
The remaining three possibilities, namely rotating K + /rotating K* + , rotat- 
ing if + /constant K* + , and constant if + /rotating K* + , will be abbreviated as 
rot. /rot., rot. /est. and est. /rot., respectively. 

Apart from the three choices with regard to the phases, the model has three 
continuous parameters. These are the strong coupling constant gx+Y P 01 the 
K + trajectory and the tensor and vector couplings of the K* + trajectory, 



v,t 

r v,t e 9 K *+Y P ( 7 , 

g k*+ ~ ^ k k+k*+ • (17) 



Here, kk+k*+ is the transition magnetic moment for K* + — > -fK + decay. 

In the following analysis, the Regge background will be constrained by the 
high-energy data, in accordance with the RPR-approach. In this energy region, 
there is a set of 72 data points for the K + A channel, comprising 56 differential 
cross section data points (%) [21], 9 photon asymmetries (£) [22] and 7 recoil 
asymmetries (P) [23]. The database for K + T,° photoproduction at high energies 
is even smaller, with only 48 differential cross section data points [21] and 9 
photon asymmetries (S) [22]. Optimisation of the above-mentioned parameters 
against these data reveals that there are several model variants with comparable 
X 2 values [4]. For example, in both K + A and K + YP photoproduction, the signs 
of G V K , + and G l K , + cannot be established conclusively using the x 2_m ethod [4]. 

The sign and phase ambiguities may not seem important for the Regge model 
itself. For the RPR model, however, an exact determination of the background 
parameters is of major importance, because it affects the extraction of the res- 
onance information. 

Rcf. [20] shows that by comparing the RPR model variants for electromag- 
netic K + A production to photo- and electroproduction data from the resonance 



region, all but one Regge background model can be eliminated. We will show 
that the Bayesian evidence can be used to distinguish among the twelve different 
models that result from the possible sign and phase combinations, using only 
the high-energy dataset. 

4. Results 

The prior n(a) is chosen to be a uniform distribution. Note that under con- 
ditions of highly concentrated likelihood, for which the prior distribution varies 
mildly, the likelihood dominates the shape of the posterior distribution [14]. 
Accordingly, the evidence calculations will not be largely affected by the choice 
with regard to the prior distribution. This means that a uniform distribution 
will lead to results that are similar to those obtained with a Gaussian or any 
other well-behaved distribution. We show that the bulk of the likelihood is 
indeed concentrated at parameter values below 100 by demonstrating that evi- 
dence calculations for prior widths equal to 100 and much greater than 100 yield 
the same results. 

Prior information exists for the coupling constants of the K + Y°p vertices. 
Indeed, the following relations follow from SU(3) symmetry [24]: 

9k+a p = ~ ^ (3 - 2a) g nNN (18) 
9k+s" p = (2« - 1) 9ttNN (19) 

where a = F/(F + D) quantifies the ratio of F-type to D-type coupling and 
g-irNN is the pion-nucleon coupling constant. It is commonly assumed that SU(3) 
symmetry can be broken at the 20% level [25]. Inserting the experimentally 
determined values a = 0.644 and g\ NN j = 14.3 yields the following prior 
ranges for the coupling constants of the K + Y°p vertices [24, 25]: 

-4.5< 9K ^ Ap <-3.0 

0.9< gg^g <1.3. 

V47T 

There are no reliable theoretical constraints for the K* + Y°p vertices [26]. 
We therefore choose a uniform distribution between zero and a value much larger 
than the natural value of one. To justify this choice for the prior interval, a sen- 
sitivity analysis of the evidence ratios is performed by repeating the calculations 
for different prior ranges. 

The results of these calculations for the K + A and K + S° production models 
are displayed in Table 2 and 3 respectively. These tables list the computed 
values of AlnZ = InZ — \nZ max , using a prior width of respectively 100, 1000 
and 10000. Changing the prior width from 100 to 1000 results in a difference 
of less than 5% in the computed values. For a prior width of 10000, the error 
increases significantly due to a reduced sampling efficiency. More importantly, 
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(20) 
(21) 



however, the ranking of the models is not significantly affected. Clearly, the 
effect of the prior width on the relative probabilities of the models is negligible, 
provided that it is large enough to contain the area where most of the likelihood 
is concentrated. 

Tabic 2: Logarithms of the evidence ratios (A In 2 = ln(Z/Z max )) for the twelve model 
variants resulting from phase and sign ambiguities in the two-trajectory Regge model for 



If+A photoproduction. The results are listed in order of decreasing probability for a prior 
width of 100. 



QV 


G* 


K+/K*+ phase 


7T = U(0, ±100) 


7r = (7(0, ±1000) 


7T = (7(0, ±10000) 






rot. / rot. 



























rot. / est. 


-24.30 


± 


0.75 


-24.7 


± 


3.5 


-24 


± 


41 




+ 


rot. / rot. 


-77.23 


± 


0.76 


-77.3 


± 


2.9 


-79 


± 


59 




+ 


rot. / est. 


-387.04 


± 


0.77 


-387.7 


± 


4.2 


-411 


± 


68 




+ 


rot. / est. 


-2366.2 


± 


1.3 


-2374 


± 


17 


-2530 


± 


280 






est. / rot. 


-2870.66 


± 


0.73 


-2871.0 


± 


4.0 


-2890 


± 


81 




+ 


est. / rot. 


-3384.83 


± 


0.83 


-3386.7 


± 


7.6 


-3440 


± 


120 






est. / rot. 


-3475.74 


± 


0.73 


-3478.5 


± 


5.4 


-3526 


± 


81 






rot. / rot. 


-4297.9 


± 


1.6 


-4317 


± 


32 


-4460 


± 


200 




+ 


rot. / rot. 


-4952.1 


± 


1.7 


-4970 


± 


27 


-5180 


± 


230 




+ 


est. / rot. 


-5092.18 


± 


0.72 


-5094.2 


± 


7.8 


-5150 


± 


130 


+ 




rot. / est. 


-19602.3 


± 


4.2 


-19710 


± 


200 


-20430 


± 


440 



Table 3: Logarithms of the evidence ratios (Aln.Z = ln(Z/Z max )) for the twelve model 
variants resulting from phase and sign ambiguities in the two-trajectory Regge model for 
K + T,° photoproduction. The results are listed in order of decreasing probability for a prior 



width of 100. 





G 


K + /K* + phase 


7T — 


(7(0, ±100) 


7T = (7(0, ±1000) 


7T = (7(0, ±10000) 






rot. / est. 























+ 




rot. / est. 


-0.50 


± 


0.42 


-0.42 


± 


0.63 


-0.55 


± 


0.97 


+ 


+ 


est. / rot. 


-0.67 


± 


0.40 


-0.63 


± 


0.63 


-0.53 


± 


0.98 


+ 


+ 


rot. / rot. 


-0.71 


± 


0.43 


-0.65 


± 


0.62 


-0.7 


± 


1.1 




+ 


rot. / rot. 


-0.79 


± 


0.44 


-0.77 


± 


0.68 


-0.68 


± 


0.97 






rot. / rot. 


-0.86 


± 


0.41 


-0.78 


± 


0.67 


-0.73 


± 


0.87 


+ 




rot. / rot. 


-0.88 


± 


0.47 


-0.79 


± 


0.68 


-0.9 


± 


1.1 




+ 


est. / rot. 


-1.01 


± 


0.44 


-1.03 


± 


0.62 


-1.1 


± 


1.1 


+ 




est. / rot. 


-1.22 


± 


0.45 


-1.10 


± 


0.68 


-1.2 


± 


1.1 




+ 


rot. / est. 


-1.23 


± 


0.46 


-1.11 


± 


0.72 


-1.09 


± 


0.98 






est. / rot. 


-1.71 


± 


0.43 


-1.65 


± 


0.65 


-1.71 


± 


0.96 


+ 


+ 


rot. / est. 


-1.97 


± 


0.48 


-1.88 


± 


0.69 


-1.87 


± 


0.99 



Furthermore, the comparison with Jeffreys' scale (Table 1) indicates that the 
p(7, K + )A data exhibit decisive evidence for the model variant with a positive 
vector and a negative tensor coupling constant, and a rotating phase for both 
trajectories. Indeed, the difference in In 2 with the second-best model is around 
24, amply exceeding the value of 5 required for a decisive statement. This 
result resolves the sign and phase ambiguity for K + A photoproduction, which 
previously could not be achieved using high-energy data alone [4]. Moreover, 
the result from this Bayesian analysis is consistent with the previous analysis 
of this particular model, but it did not require an additional analysis with data 
from the resonance region (for which E? , < 3 GcV) . In other words, the Nested 
Sampling method requires less experimental data to reach the same conclusion 
as the x 2 -a n alysis which used a much larger set of K + A photoproduction data. 
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Figure 3: Comparison of the log-likelihood of |G^.„ + | for the two best models. The model 
with the highest evidence is a rot. /rot. (RR) model with a positive estimation value for this 
parameter. The second-best model, rot. /est. (RC), has a negative estimation value. 



Apart from the relative probability of the different models, the Nested Sam- 
pling technique also provides us with an estimation value of the different pa- 
rameters. In our best model, these values are: 

-3.22 ±0.04 
12.47 ±0.14 

-32.19 ±0.50. (22) 

Additional calculations demonstrate that the results do not change signif- 
icantly when the gK+Y°p coupling constant is allowed to deviate up to 40% 
from SU(3) predictions. For example, the values of A In Z for the second and 
third K + A production models are —24.21 ±0.73 and — 77.09±0.70 respectively, 
agreeing with the values found for 20% SU(3) symmetry breaking. The esti- 
mation values for the coupling constants are not affected. We conclude that 
the high-energy p(7, K + )A data support a coupling constant gx+hp compatible 
with a level of SU(3) symmetry breaking of at most 20%. The estimation value 
for gK+Ap deviates 15% from the SU(3) prediction. 

Fig. 3 shows the log-likelihood of the parameter G V K , + , integrated over the 
remaining two parameters, using a uniform prior between -100 and 100. The 
likelihood is determined by the high-energy K + A photoproduction data only. 
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Figure 4: Comparison of the log-likelihood of |G^.„ + | for the four best model variants. In 
contrast with Fig. 3, the model with the highest evidence value does not have the highest peak. 
Although the rot. /est. (RC) models surpass the est. /rot. (CR) and rot. /rot. (RR) variants, 
the difference between the evidences is still too small to make a decisive statement. 



The greater width and height of the peak of the rot. /rot. model variant indicate 
that it should have a greater evidence than the alternative model. 

The results for K + Yj° production, listed in Table 3, are not as clear-cut 
as those for p(j, K + )A. The difference in InZ of the model variants is next 
to negligible. This was to be expected, as a smaller dataset provides fewer 
restraints on the models' parameters. 

Even when the extensive set of resonance-region data is taken into account, 
the experimental data from the proton target alone does not allow us to sin- 
gle out one background model for the K + TP channel [20]. However, a recent 
analysis of the corresponding reaction on the neutron, n(j, K + )T,~ , was able to 
resolve the remaining ambiguity [27]. The models' parameters were converted 
from the proton to the neutron channel using isospin considerations. 

The log-likelihood of the parameter G V K , + for i4T + £° production is shown in 
Fig. 4. Note that despite boasting the maximal likelihood value, the rot. /rot 
model has a lower evidence than the rot. /est. model. 

5. Conclusions and outlook 

Bayesian inference provides us with a promising tool for model comparison. 
We have demonstrated this by using the Nested Sampling algorithm to compute 
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the Bayesian evidence for different model variants of a Rcgge model for KY 
photoproduction. The results of this calculation indicate that there is decisive 
evidence for a K + A production model with G V K , + > 0, G l K , + < 0, and a rotating 
phase factor for both trajectories. This conclusion could not be drawn from the 
high-energy data set by means of the method of \ 2 minimisation. 

For K + T,° production, the differences in evidence are too small to draw a 
decisive conclusion, and supplementary data is required to fully determine the 
background model for this channel. 

The Nested Sampling method has many applications, both for the RPR 
model and for other research. One of these applications is the accurate esti- 
mation of model parameters as well as the elimination of nuisance parameters. 
More importantly, however, this method may provide us with a means to ad- 
dress the missing-resonance problem by calculating the probability of individual 
resonance contributions in a Bayesian framework. This is an approach we intend 
to explore in the near future. 
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